248 ◾ Bioinformatics
HOMER is another motif discovery program that has an algorithm which performs de
novo motif discovery in the regulatory regions of genes using a primary dataset and a sec-
ondary dataset. HOMER has two programs, findMotifs.pl and findMotifsGenome.pl, that
perform the motif discovery in promoter and genomic regions, respectively. However, for
ChIP-Seq enrichment sequences, we will use findMotifs.pl or homer2. Like MEME Suite
programs, HOMER can also create a secondary random dataset if you do not have one.
For the instructions of HOMER installation, visit “http://homer.ucsd.edu/homer/intro-
duction/install.html”. Homer includes a collection of perl and c++ programs designed to
run in a UNIX/Linux environment. Refer to the website for the program requirements.
On Ubuntu, after making sure that all required programs have been installed, you can
create a directory “homer”, download the installation Perl script, and install the HOMER
as follows:
mkdir homer; cd homer
wget http://homer.ucsd.edu/homer/configureHomer.pl
perl configureHomer.pl -install
After the installation is complete, you will get a message as follows:
Add this line to your .bash_profile or .bashrc file (or other depending on your shell):
PATH=$PATH:/home/username/downloads/homer/.//bin/
The path will depend on where you have installed HOMER. Copy the path and add it to the
end of “.bashrc” file as follows:
export PATH=$PATH:/home/username/downloads/homer/.//bin/:$PATH
You may need to restart the terminal or use “source ~/.bashrc” for the change to take effect.
To test HOMER installation, run “findMotifs.pl”. That will display HOMER usage and
option. The “findMotifs.pl” command requires your target sequences’ file and a background
sequence file in FASTA format specified by “-fasta” option. However, if you do not provide
a background sequences’ file, the command will use your target sequences to create back-
ground sequences. The general syntax for the use of “findMotifs.pl” command is as follows:
findMotifs.pl <targetSequences.fa> fasta <output directory> -fasta
<background.fa> [options]
We can use this command without providing background sequences’ file as follows:
findMotifs.pl chip1_peaks.fasta fasta homer_motifs_chip1/
findMotifs.pl chip2_peaks.fasta fasta homer_motifs_chip2/
findMotifs.pl chip3_peaks.fasta fasta homer_motifs_chip3/
Three output directories will be created and several files are produced in each of the output
directories: